Chinese Information Retrieval Using Lemur: NTCIR-5 CIR Experiments at UNT

نویسندگان

  • Jiangping Chen
  • Rowena Li
  • Fei Li
چکیده

This paper describes our participation in NTCIR-5 Chinese Information Retrieval (IR) evaluation. The main purpose is to evaluate Lemur, a freely available information retrieval toolkit. Our results showed that Lemur could provide above average performance on most of the runs. We also compared manual queries vs. automatic queries for Chinese IR. The results show that manually generated queries did not have much effect on IR performance. More analysis will be carried out to discover causes behind hard topics and ways to improve the overall retrieval performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Chinese QA and CLQA: NTCIR-5 QA Experiments at UNT

This paper describes our participation in the NTCIR-5 CLQA task. Three runs were officially submitted for three subtasks: Chinese Question Answering, English-Chinese Question Answering, and Chinese-English Question Answering. We expanded our TREC experimental QA system EagleQA this year to include Chinese QA and Cross-Language QA capabilities. Various information retrieval and natural language ...

متن کامل

LCC-DCU C-C Question Answering Task at NTCIR-5

This paper describes the work for our participation in the NTCIR-5 Chinese to Chinese Question Answering task. Our strategy is based on the “Retrieval plus Extraction” approach. We first retrieve relevant documents, then retrieve short passages from the above documents, and finally extract named entity answers from the most relevant passages. For question type identification, we use simple heur...

متن کامل

UNT 2005 TREC QA Participation: Using Lemur as IR Search Engine

This paper reports our TREC 2005 QA participation. Our QA system EagleQA developed last year was expanded and modified for this year’s QA experiments. Particularly, we used Lemur 4.1 (http://www.lemurproject.org/) as the Information Retrieval (IR) Engine this year to find documents that may contain answers for the test questions from the document collection. Our result shows Lemur did a reasona...

متن کامل

KECIR: An Information Retrieval System for IR4QA Task

― 107 ― KECIR An Information Retrieval System for IR4QA Task Dongfeng Cai, Shengqiao Kang, Yu Bai, Peiyan Wang Knowledge Engineering Research Center, Shenyang Institute of Aeronautical Engineering [email protected] Abstract This paper describes our work on the subtask of simplified Chinese monolingual information retrieval for question answering system at ntcir-8. We use the lemur toolkit to...

متن کامل

How Similar are Chinese and Japanese for Cross-Language Information Retrieval?

For NTCIR Workshop 5 UC Berkeley participated in the bilingual task of the CLIR track. Our focus was on Chinese topic searches against the Japanese News document collection, and on Japanese topic search against the Chinese News Document Collection. Extending our work of NTCIR 4 workshop, we performed search experiments to segment and use Chinese search topics directly as if they were Japanese t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005